Di usion of Context and Credit Informationin Markovian

نویسنده

  • Yoshua Bengio
چکیده

This paper studies the problem of ergodicity of transition probability matrices in Marko-vian models, such as hidden Markov models (HMMs), and how it makes very diicult the task of learning to represent long-term context for sequential data. This phenomenon hurts the forward propagation of long-term context information, as well as learning a hidden state representation to represent long-term context, which depends on propagating credit information backwards in time. Using results from Markov chain theory, we show that this problem of diiusion of context and credit is reduced when the transition probabilities approach 0 or 1, i.e., the transition probability matrices are sparse and the model essentially deterministic. The results found in this paper apply to learning approaches based on continuous optimization, such as gradient descent and the Baum-Welch algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Di � usion of Credit in Markovian Models

This paper studies the problem of di usion in Markovian models such as hidden Markov models HMMs and how it makes very di cult the task of learning of long termdependencies in sequences Using results from Markov chain theory we show that the problem of di usion is reduced if the transition probabilities approach or Under this condition standard HMMs have very limited modeling capabilities but i...

متن کامل

Di usion of Credit in Markovian

This paper studies the problem of diiusion in Markovian models, such as hidden Markov models (HMMs) and how it makes very diicult the task of learning of long-term dependencies in sequences. Using results from Markov chain theory, we show that the problem of diiusion is reduced if the transition probabilities approach 0 or 1. Under this condition, standard HMMs have very limited modeling capabi...

متن کامل

QMW-PH-96-17 Linear quantum state di usion for non-Markovian open quantum systems

We demonstrate the relevance of complex Gaussian stochastic processes to the stochastic state vector description of non-Markovian open quantum systems. These processes express the general Feynman-Vernon path integral propagator for open quantum systems as the classical ensemble average over stochastic pure state propagators in a natural way. They are the coloured generalization of complexWiener...

متن کامل

Stochastic Differential Games in a Non-Markovian Setting

Stochastic di erential games are considered in a non-Markovian setting. Typically, in stochastic di erential games the modulating process of the di usion equation describing the state ow is taken to be Markovian. Then Nash equilibria or other types of solution such as Pareto equilibria are constructed using Hamilton-Jacobi-Bellman (HJB) equations. But in a non-Markovian setting the HJB method i...

متن کامل

LAMN property for integrated di usions

In this paper we prove the Local Asymptotic Mixed Normality (LAMN) property for the statistical model given by the observation of local means of a di usion processX. Our data are given by ∫ 1 0 X s+i n dμ(s) for i = 0, . . . , n−1 and the unknown parameter appears in the di usion coe cient of the process X only. Although the data are nor Markovian neither Gaussian we can write down, with help o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995